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Searches for known waveforms in gravitational wave detector data are often done using matched 
filtering. When used on real instrumental data, matched filtering often does not perform as well as 
might be expected, because non-stationary and non-Gaussian detector noise produces large spurious 
filter outputs (events). This paper describes a \ 2 time-frequency test which is one way to discrimi- 
nate such spurious events from the events that would be produced by genuine signals. The method 
works well only for broad-band signals. The case where the filter template does not exactly match 
the signal waveform is also considered, and upper bounds are found for the expected value of \ 2 - 
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I. INTRODUCTION 

Matched filtering is a common and effective technique 
used to search for signals with a known waveform in a 
data stream 0. The output of a matched filter will be 
large if the data stream contains the desired signal. But 
it can also be driven to large values by spurious noise. 
This paper describes a x 2 time-frequency discriminator 
statistic which has proven effective at distinguishing be- 
tween these two possibilities. 

The method was invented by the author in 1996 for 
use on data from the LIGO 40m prototype gravitational 
wave detector Q . The method was subsequently used in 
the analysis of data from the Japanese TAMA detector 
[J IE IE S Hi an d the first analysis of science data from 
the full-scale LIGO detectors It has also been used 
in p reliminary searches using VIRGO engineering data 
[Tc( and GEO-600 data Until now the only detailed 
description of the method was in the documentation for 
the GRASP software package Q, as referenced in the 
publications above. This paper describes the method in 
more detail, and analyzes its properties. 

The x 2 time-frequency discriminator is designed for 
use with broad-band signals and detectors. The essence 
of the test is to "break up" the instrument's bandwidth 
into several smaller bands, and to see if the response in 
each band is consistent with what would be expected 
from the purported signal. This method can only be 
used to discriminate signals for which the gravitational 
waveform is known, meaning that it can be calculated in 
advance, with high precision. (Note: the word "known" 
is slightly misleading since the waveform typically still 
depends upon a few unknown parameters, such as the 
overall scale and initial phase.) 

A new generation of broadband gravitational wave de- 
tectors is now undergoing commissioning 12] and more 
sensitive instruments are in the planning and design 
stages [l3l Il4j . We expect that this test will prove useful 
for those instruments as well. 

In searching for signals and setting upper limits, the 
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primary use of the x 2 time-frequency discriminator is 
as a veto. This means that events which might other- 
wise be used, analyzed or studied in more detail are re- 
jected because they have a x 2 value which is too large. 
In general terms, the x 2 time-frequency discriminator 
may be thought of as a method for reducing contri- 
butions from the non-Gaussian tails that characterizes 
most gravitational- wave detectors ^3|. Substantial ef- 
forts have been made to characterize these tails in the 
TAMA HEIIl, Explorer [l^, and Nautilus [3 detec- 
tors. Other methods for reducing the effects of these 
tails have also been proposed and/or used. For example 
Creighton [lj| proposes a simple analytic model for non- 
Gaussian tails and uses it to characterize a network detec- 
tion algorithm which is insensitive to this non-Gaussian 
tail. Other filtering methods, based on locally-optimal 
statistics which are less sensitive or insensitive to non- 
Gaussian tails have been proposed |20l |21| for matched 
filtering and stochastic background searches. Shawhan 
and Ochsner [2^] have developed a heuristic veto method 
for matched filtering for binary inspiral, based on count- 
ing threshold crossings in a short-time window. When 
tuned for the LIGO SI data set with a half-second win- 
dow, the method is effective, and provides a veto which 
is complementary to the % 2 time-frequency discrimina- 
tor presented here. Some related ideas have also been 
explored by Guidi J2^ ■ 

The principal source that will serve as an example here 
is the gravitational radiation back-reaction driven inspi- 
ral of pairs of compact stars, also known as "binary inspi- 
ral" . If each of the two stars (typically neutron stars or 
black holes) has masses smaller than a few solar masses, 
then the waveforms can be accurately calculated over 
the typical detector bandwidth (30-500 Hz) using post- 
Newtonian approximations pi l2fjl El El El El I30L El 
I32I l33| . In this case, the unknown signal parameters in- 
clude an overall amplitude scale, the masses and spins 
of the two stars, a fiducial reference time (often taken to 
be the "coalescence time" ) , and the initial phase of the 
orbit. (The final unknown parameter, the orbital incli- 
nation, is degenerate with these other parameters, and 
may therefore be ignored. The orbital eccentricity may 
also be neglected: by the time such systems are emitting 
in the detection band, they have radiated away any ec- 
centricity and the orbit has circularized.) These signals 



are broadband, since the binary system is observed at the 
very end of its life when the signal frequency is increas- 
ing rapidly as the orbital period decreases and the stars 
spiral together. 

The paper is structured as follows. Section HP defines 
the notational conventions that are used. Section [ffl] de- 
rives the form of the optimal matched filter in the sim- 
plest case and describes its properties. Section Hvl defines 
the x 2 time-frequency discriminator and derives its basic 
properties, also for the simplest case. Section [V] gives 
a brief illustrative example of this statistical test in ac- 
tion, computing and comparing the \ 2 values obtained 
for a simulated inspiral signal and a spurious noise event. 
Readers interested in acquiring a quick understanding of 
the method without seeing the technical details should 
start with this Section. 

A significant problem in searching for gravitational 
wave signals is that the waveform depends upon a num- 
ber of parameters (for example, masses) of the source. 
For this reason, data must be searched with a bank of 
filters designed to cover this parameter space [2 
Since this bank is discrete, and the source parameters are 
continuous, the match between signal and filter is never 
exact. The effects of this signal/template mismatch on 
the \ 2 discriminator are investigated and quantified in 
Section IVll One of the main results of this paper is an 
absolute upper limit on the expected value of x 2 arising 
from template/signal mismatch. 

Up to this point, the signals studied are of the simplest 
type, which can be completely described with only two 
parameters: an overall amplitude, and an offset/arrival 
time. However this is insufficient for most cases of inter- 
est, where the signals are an (a-priori unknown) linear 
combination of two different polarizations. Section IVIII 
treats this case, deriving two-phase results analogous to 
the single-phase results of the previous Sections. 

Section IVIIII examines suitable thresholds on \ 2 f° r 
stationary Gaussian noise, and contrasts these with the 
heuristic thresholds used in published analysis of real de- 
tector data such as the LIGO SI binary inspiral upper 
limit analysis [jj. 

Section IIXI examines a variation of the discriminator 
based on "unequal expected SNR" intervals, and shows 
that this discriminator still has most of the properties of 
the x 2 discriminator defined in previous Sections. . 

There are an infinity of possible x 2 -like statistical tests 
and discriminators. Work by Baggio et al. 36] intro- 
duced a x 2 test f° r use with resonant-mass gravitational 
wave detectors. In Section^ the x 2 time-frequency dis- 
criminator of this paper is compared to that test. While 
the tests share some similar features, they have quite dif- 
ferent properties and behavior. 

This is followed by a brief Conclusion, which summa- 
rizes the main results and some of the unanswered ques- 
tions. 

Appendix lAl contains a short calculation proving that 
the time-frequency discriminator defined in this paper 
has a classical x 2 distribution if the detector's noise is 



Gaussian. Appendix [B] derives a simple mathematical 
result used in the body of the paper, 

II. CONVENTIONS 

The Fourier Transform of a function of time V(t) is 
denoted by V(f) and is given by 

V(f) = J e- 2nift V(t)dt. (2.1) 

The inverse Fourier transform is 

V(t) = J e^Viftdf. (2.2) 

All integrals are from — oo to oo, unless otherwise indi- 
cated, and * denotes complex conjugate. 

The detector output (typically a strain) is denoted by 

s(t) = n(t) + h(t) (2.3) 

where n(t) is the (real) strain-equivalent noise produced 
by fluctuations within the detector and its environment, 
and h(t) is a (real) gravitational waveform of astrophys- 
ical origin. 

Since the detector's noise n(t) can only be character- 
ized statistically, one must introduce tools for determin- 
ing the expected properties of quantities measured in the 
presence of this noise. There are several equivalent ways 
to do this. In this paper, we imagine that n(t) is a ran- 
dom time-series drawn from a large ensemble of such time 
series, whose statistical properties are those of the instru- 
ment noise |37|. If W is some functional that depends 
upon n(t), then angle brackets (W) denote the average 
of W over the ensemble of different n{t). 

We assume that (n(t)) vanishes, which implies that 
(n(/)) = 0. We also assume that the statistical prop- 
erties of the detector's noise are second-order stationary 
|44| which implies that the expectation value (n(t)n(t')} 
depends only upon the time difference t — t' . It then 
follows that in frequency space 

(n(f)n*(f))=S n (f)S(f-n (2-4) 

where 5(f) is the Dirac delta- function. The two-sided 
noise power spectrum is a real non-negative even function 
S n (f) which from (|2.4() can be shown to satisfy 

(n 2 (t)) = J S n (f) df. (2.5) 

This implies that 2S n (f)df may be interpreted as the ex- 
pected squared strain in the frequency band from / to 
/ + df, for / > 0. (Note that much of the literature 
on this subject, including publications of the LIGO Sci- 
entific Collaboration, uses a one-sided power spectrum 
2S n (f), because this is typically the quantity measured 
by standard instrumentation. Its use here would compli- 
cate many formulae with extraneous factors of two.) 
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As explained earlier, we are interested in the case of 
a known waveform. A prototypical example is a binary 
inspiral chirp waveform, which may be written as |27| 

h{t) = — (cos <f>T c (t- to) + sin <pT 8 (t- t )). (2.6) 

This waveform depends upon three nuisance parameters. 
These are the effective distance d to the source, a fiducial 
time to (for example the coalescence time of the binary 
pair) and a phase </> which is determined by the orbital 
phase of the binary pair and its orientation relative to 
the detector. 

The templates T c and T s are the signal waveforms that 
would be produced by a binary inspiral pair optimally 
oriented with respect to the detector, at distance D, in 
the two possible polarization states (corresponding to ro- 
tating the detector axes by 45°). The waveform may de- 
pend upon additional nuisance parameters; we will return 
to this later. 

For pedagogic purposes, we first consider the simpler 
case in which the phase <j) of the waveform is known a 
priori, in advance, 



h(t) = ^T(t-t a ) 
a 



(2.7) 



In this case there are only two nuisance parameters: time 
of arrival and effective distance d. 

The quantity D is the canonical distance at which a 
source, optimally-oriented with respect to the detector, 
would produce the waveform T. Its value determines the 
overall normalization scale of the waveform T, since, for 
a given source type, the quantity DT(t) is independent of 
D, and is determined by the laws of physics, specifically 
General Relativity. 



III. MATCHED FILTERING 

A matched filter is the optimal linear filter for detection 
of a particular waveform. Its form can be derived using 
a number of different techniques. Here we use one of the 
classical signal analysis methods. 

For notational purposes it is useful to introduce a Her- 
mitian inner product defined by 



{A,B)=J 



A*(f)B(f) 

Sn(f) 



df, 



(3.1) 



for any pair of complex functions A(f) and B(f). The 
frequency dependence of A and B will usually be implied 
and not indicated explicitly. 

A real detector functions only over a finite frequency 
band, and acquires data at a finite sample rate. In this 
case, the noise power spectrum S n may be taken to be 
infinite outside the bandwidth of the instrument, effec- 
tively restricting the range of integration to lie between 
plus and minus the Nyquist frequency /jy = 1/(2 At), 
where At is the time between successive data samples. 



The matched filter is a linear operator that maximizes 
the ratio of "signal" to "noise" . We denote the filter by 
Q*{f)/S„(f) and the output of the filter by z, so 

z = J ^- ur df = (Q,s). (3.2) 

We require that z be real, which implies that Q(f) = 
Q*{—f), and also means that Q(f)/S n (f) corresponds 
to a real function (filter kernel) in the time domain. 

The expected value of z can be found from l|2.7|l . and 
is given by 



D 
1 



f e -27ri/i 



(3.3) 



Here T(f) denotes the Fourier transform of T(t); the 
translation in time by to is explicitly encoded in the ex- 
ponential term. Note that in the absence of a source 
(d — > oo) the expected value of z vanishes since (n(f)} = 
implies that (s(/)) = 0. 

The expected value of the square of z may be found 
using l|2.4|l 



(3.4) 



To estimate the error or uncertainty in a measurement of 
z, it is useful to define 



6z 



(z). 



(3.5) 



The error or uncertainty in a measurement of z, due to 
noise in the detector, is 



^5zY) = ^{{z-{z)f) 



\*~) - \ z ) 



(Q,Q) 



(3.6) 



From these quantities, we can now derive the properties 
of the optimal matched filter. 

Under the assumptions that we have made about the 
detector output (|2.3|l the optimal choice of matched fil- 
ter Q is the one that maximizes the ratio of the expected 
filter output < z > given by (|3.3|) to its expected un- 
certainty H3.6fl due to detector noise. Hence the optimal 
choice of Q maximizes 



(Q,A) 



VPF) (Q,Q) 1/2 ' 

where we have defined 

d 



(3.7) 



(3.8) 



Since the inner product is Hcrmitian, Schwartz's inequal- 
ity states that 



\(Q,A)\ 2 <(A,A)(Q,Q). 



(3.9) 
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The two sides are equal if and only if Q is proportional 
to A. Hence the ratio . T|) is maximized when Q{f) is 
proportional to A{f). Thus, the optimal filter is a time- 
reversed copy of the template, weighted by the expected 
noise in the detector |45j . 

The Signal to Noise Ratio (SNR) is defined by the ra- 
tio of the observed filter output z to its (expected or ob- 
served) root-mean-square fluctuations 



SNR 



(Q,s) 
(Q,Q) 



(3.10) 



and is independent of the normalization of the optimal 
filter Q. By definition, in the absence of a signal (SNR) = 
and ((SNR) 2 ) = 1. 

It is convenient to choose the normalization of the op- 
timal filter Q so that [Q,Q) = 1. This may be achieved 
by choosing the filter Q to be 



Q(/) = (f,f) 



~ 1/2 f(f) e" 



2mft Q 



(3.11) 



With this normalization choice, the filter output z is 
equal to the SNR. Henceforth we will use z to denote 
both of these quantities. 

While the optimal filter Q is explicitly independent 
of the normalization scale of the template T, we showed 
earlier that the scales of D and T could be freely adjusted 
provided that their product DT was held fixed. For the 
purposes of interpreting the SNR z, it is convenient to 
set the distance scale D so that 



(f,f) = l. 



(3.12) 



With this choice of normalization, the expected value of 
the SNR is 



(3.13) 



This choice of normalization is thus equivalent to choos- 
ing the distance D at which the template is defined to be 
the distance at which an optimally-oriented source would 
have an expected SNR of unity: (z) = 1. 

Since the expected value of z is proportional to the 
inverse distance, one may use the actual measured value 
of z to estimate the distance. Since the actual measured 
value of z is affected by instrument noise, this estima- 
tor has some average error. One can easily estimate the 
error, since with our normalization choices 



{z z ) = 1 + I — 1 , and hence 



((6zf) = 1. 



(3.14) 



This means that the expected fractional error in estimat- 
ing the inverse distance to the source is 



((6z) 



2X1/2 



1 

V) 



d 

D 



(3.15) 



Thus, a measured SNR of z = 10 implies a fractional 
accuracy in distance determination of about 10%. 

Up to this point, we have been assuming that the 
fiducial coalescence time to is known. In practice, one 
searches a data stream for statistically significant values 
of z, for all possible choices of to. As a function of to the 
SNR is 



z{to) 



Hf)f* 



a 2iri/t 



Sn(f) 



df. 



(3.16) 



Because this is just an inverse Fourier transform, it is 
both practical and simple to compute this quantity from 
a data stream s(t). For example the Fast Fourier Trans- 
form (FFT) algorithm allows the r.h.s. to be computed 
in order NhiN operations, where N is the number of 
data samples of h in the time or frequency domain. 



IV. THE x 2 DISCRIMINATOR TEST 

In the previous section, we assumed only that the de- 
tector noise was second-order stationary. It is quite com- 
mon in such studies to also assume that the noise is Gaus- 
sian. One may then show that the probability of the 
SNR exceeding some threshold falls exponentially with 
increasing threshold, and so large values of the SNR have 
low probability of being due to noise in the detector, and 
thus are a good indication that a real source is present. 

Unfortunately, experience has shown that the noise in 
broadband gravitational wave detectors is far from Gaus- 
sian. Typically it has a Gaussian or Gaussian-like com- 
ponent (arising from electrical, thermal and shot noise) 
but there is another "glitchy" component that could be 
described as Poisson-like impulse noise. There are many 
sources of this noise, including marginally stable servo 
systems and environmental anomalies. The effects of 
this noise on the filtering process described above can 
be quite dramatic. Whereas the matched filter is de- 
signed to give a large response when the signal waveform 
matches the template, it also can give a large response 
when the instrumental noise has a large glitch. Although 
the waveform of this glitch looks nothing like the tem- 
plate, it nevertheless drives the filter output to a large 
value. 

The statistical test described here provides a way to 
determine if the output of the filter is consistent with 
what might be expected from a signal that matched the 
template. To be effective, both the signal and the detec- 
tor must be broadband. 

One way to understand how this test works is to imag- 
ine that instead of a single broadband detector, one 
is given p data streams from p different, independent 
narrow-band detectors, each operating in a different fre- 
quency band. For each detector, one can construct an 
optimal filter for the signal, and then one can ask if the 
results are consistent, for example, if the p (potentially 
different) fiducial times to which maximize the output of 
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FIG. 1: A typical set of frequency intervals Afj for the case 
p = 4. These intervals are narrowest where the detector is the 
most sensitive, and broadest where it is least sensitive. 



each of the p independent detectors are consistent with 
a single value. 

Begin by assuming that, using matched filtering as pre- 
viously described, we have identified a time of arrival to 
and inverse distance D/d. The goal is to construct a 
statistic which indicates if the filter output is consistent 
with this signal. 

We will do this by investigating the way in which z(to) 
gets its contribution from different ranges of frequencies. 
To do this, we partition the frequency range / € [0, oo) 
into a set of p distinct subintervals A/i , • • • , Af p whose 
union is [0, oo). The frequency intervals: 

A/i = {/ | < / < /J 

Ah = {/ 1 h < f < h] 

Afp-! = {/ I fp-2 <f< fp-l} 

Afp = {/ I U-i < f < oo}, 



(4.1) 



will be defined by the condition that the expected signal 
contributions in each frequency band from a chirp are 
equal. (Note that one may also pick intervals which do 
not satisfy this condition. In Section IIXI we show that 
when suitably defined, the resulting statistic still has a 
classical x 2 distribution for the case of Gaussian detector 
noise.) 

To define the frequency bands, it is helpful to introduce 
a set of p Hcrmitian inner products (for j = 1, • • • ,p) 
defined in analogy to l|3.1[) by 



(A B).=J_ 



A*(f)B(f) 

Sn(f) 



df. 



(4.2) 



In each of these integrals, the range of integration is over 
both the positive and negative frequencies. As discussed 
following (|3.1|) . since S n (f) may be taken as infinite for 
|/| greater than the Nyquist frequency /at, the effec- 
tive upper limit of the final frequency interval Af p is 
/jv rather than oo. 

Since the frequency intervals don't overlap, but cover 
all frequency values, the sum of these inner products 



(A,B)=j2(A,B). 



(4.3) 



yields the inner product l|3.1|l defined earlier. The p dis- 
tinct frequency bands are uniquely determined by the 
condition that 



choose Afj so that (f, f)j 



1 



(4.4) 



A typical set of frequency intervals in shown in Figure^ 
For given instrumental noise S n (/) the frequency inter- 
vals Afj depend upon the template T. However it may 
be the case that many templates actually share the same 
frequency intervals Afj . A good example of this is the 
set of stationary-phase post-Newtonian templates, where 
the amplitude is calculated in the first post-Newtonian 
approximation, and the phase is calculated to higher or- 
der [3l,[3!|. For these templates, the frequency intervals 
are determined by 



(T,T)j 
/ -7/3 



Af } S n (f) 



df = - 



i(f,f) 
p 



1 

P Jo 



(4.5) 



r 7/3 

s„(f) 



df 



provided that mi and mi lie in a range for which the 
stationary phase approximation holds within the detector 
band ^(|. For this family of templates, all the templates 
share the same bands Afj. 

The SNR (|3.2fl is an integral over all frequencies, and 
can be written as a sum of contributions from each of the 
p different bands, 



p 



with Zj = (Q, s)j 



(4.6) 



The expected values of Zj and its square are computed 
using the same techniques as before, and give 



(Zj) = - — , and 
p d 

< Z I> = - + — b" 

J p p z a 
In the absence of a signal (take d — * oo) one finds 

{ Zj ) = and (z 2 ) - 1. 



(4.7) 



(4.8) 



This suggests an obvious statistical test to see if the signal 
is consistent with the model. 

Consider the p quantities defined by 



A 

Azj = z.j . 

P 



(4.9) 



These are the differences between the SNR in the band 
Afj, and the SNR that would be anticipated 0| in that 
band, based on the total measured SNR in all bands. By 
definition, these differences sum to zero 

n 
3=1 

and their individual expectation values vanish 

(Azj) = 0. (4.11) 
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To calculate the expectation values of their squares, first 
note that the quantity (zjz) must, by symmetry, be j- 



independent 



Since the sum over j of (zjz) yields 



-r 



one must have 



( z j z ) 



1+ -T 



(f) = 1 
P P 

Thus the expectation value of the square of Azj is 

Z N 2, 



(4.12) 



2(z,-z> 



1 - 



(4.13) 



Notice that these quantities do not depend upon d. 
In fact these quantities, and their second-order statistical 
properties, are independent of whether or not a signal is 
present. This motivates the definition of a discrimination 
statistic. 

We define the x 2 time- frequency discriminator statistic 
by 



X 2 = X 2 {z x , ■ ■ ■ ,z p ) =p^(Azj) 2 

3=1 



(4.14) 



This choice of statistic is one of the main results of the 
paper: in what follows we will study its properties in 
detail. 

It follows immediately from l|4.13[l that the expected 
value of x 2 is 

( X 2 )=p-l (4.15) 

Up to this point, the only assumption we have made is 
that the noise in the instrument is second order station- 
ary, specifically that (n(t)n(t')) depends only upon t — t'. 
To further analyze the properties of this statistic, addi- 
tional assumptions are needed. 

In the design of signal processing algorithms, it is com- 
mon to analyze the performance of the method in the case 
where the instrument noise is both stationary and Gaus- 
sian. In this case, Appendix ^ shows that probability 
distribution function of x 2 is a a classical ^-distribution 
with p — 1 degrees of freedom. The (cumulative) proba- 
bility that x 2 < Xo is 



x^<x 



3 

"2e 



r(f-|) 



du 



7(f 



2 - 



r(f 



I) 



(4.16) 
(4.17) 



where 7 is the incomplete gamma function. In this case, 
where the noise is assumed to be stationary and Gaus- 
sian, the expected distribution of x 2 values is quite nar- 
row. One has 



2\2\ 



P 



(4.18) 



which implies that the "width" of the x 2 distribution is 

(((x 2 ) 2 ) - <x 2 ) 2 ) 172 = V^T) 



(4.19) 



Thus, if the noise were stationary and Gaussian, we 
would expect to find x 2 values in the range [p — 1 — 
\/2(p — 1), p — 1 + \/2{p — 1)] . Since the fractional width 
of this range decrease with increasing p, one might expect 
that large values of p are desirable, since they appear to 
give high discriminating power. 

Practice and experience have shown that large values 
of p do not, in fact, work very well |11|. Partly this is 
because the detector noise is neither stationary nor Gaus- 
sian, and partly this is because the signal is not a perfect 
match to the template. Large values of p tend to spread 
non-stationary glitch noise over many frequency bands, 
diluting its effect on x 2 - This is difficult or impossible 
to model analytically, and can best be understood (as in 
[Tl|) by Monte-Carlo studies of simulated signals added 
into real detector noise. However the effects of a signal- 
template mismatch can be studied analytically; this is 
done in Section IvH 



V. HOW DOES THE x TEST WORK? 

The x 2 test was invented based on experience filtering 
data from the LIGO 40m prototype instrument 0, 0- 
It was observed that a binary inspiral filter bank regis- 
tered many events that (when converted to audio) did 
not sound like inspiral signals. In particular, the low fre- 
quency component of the signal did not arrive first, fol- 
lowed by the midrange and high frequency components. 
The x 2 test first arose from considering a set of matched 
filters in different bands, and testing to see if the filter 
outputs all peaked at the correct time. The signal z\ was 
constructed from the lowest frequency band, Z2 from the 
next frequency band, and so on. This is illustrated in 
Figure |21 for a single-phase test with p — 4 bands. 

It is instructive to compare the values of the filter out- 
puts (single-phase test) for the two cases shown in Figure 
[2] For the simulated chirp, the signal-to-noise ratio was 
z = 9.2 and the signal values in the different bands were 



Z\ 


= 2.25 




Z2 


= 2.44 




Z3 


= 1.87 




z± 


= 2.64 




z 


= Z\ + z 2 H 


- z 3 + z 4 = 9.2 


x 2 


= 4l> 


- z/4) 2 = 1.296 



3=1 



Px 2 > 1-296 



7(3/2,0.648) 
T(3/2) 



73%. 



This is quite consistent with the value of x 2 that would 
be expected for a chirp signal in additive Gaussian noise. 
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Simulated Chirp (SNR = 9.2) 



Spurious Event (SNR = 8.7) 



High freq filter 
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t = t„ 



t=t„ 



FIG. 2: The output of p = 4 single-phase filters for a simu- 
lated chirp signal added into a stream of detector noise (left 
set of figures) and a transient burst present in detector noise 
(right set of figures). For the simulated chirp, the filters in 
the different frequency bands all peak at the same time offset 
to: the time offset which maximizes the SNR. At this instant 
in time, all of the Zj are about the same value. However when 
the filter was triggered by the transient burst, the filters in the 
different frequency bands peak at different times. At time to 
they have very different values (some large, some small, and 
so on). 



For the spurious noise event shown in Figure [21 the 
SNR z — 8.97 was quite similar but the value of \ 2 is 
very different: 



Z\ 


= 0.23 




Zl 


= 0.84 




Z3 


= 5.57 




z 4 


= 2.33 




z 


= Zl + z 2 - 


- z 3 + z 4 = 8.97 


x 2 


= 4 I> 


- z/Af = 68.4 



(5.2) 



V>68.4 = 1 m/2) 



7(3/2,34.2) 
1 - = 9.4 x 10~ 15 . 



The probability that this value of \ 2 would be obtained 
for a chirp signal in additive Gaussian noise is extremely 
small. 



VI. EFFECT OF A SIGNAL/TEMPLATE 
MISMATCH ON X 2 

In the previous two Sections, we analyzed an optimal 
filter and constructed a \ 2 statistic for the case where 
the signal waveform was known exactly. In practice, this 



is not possible. Typically, signal waveforms come from a 
family characterized by a set of continuous parameters, 
such as masses and spins. Thus, in practice, to search 
for signals one uses a discrete set of templates, called a 
template bank 34, 35] . Such banks can contain anywhere 
from dozens to hundreds of thousands of templates. Since 
each template in the bank is defined by a point in parame- 
ter space, the template bank may be thought of as a grid, 
or mesh, in parameter space. Typically, this grid is laid 
out to ensure that any signal from the continuous family 
of waveforms is "near" some point in the grid. In this 
section, we analyze the case where the signal waveform 
is "close" to the template waveform, but not a perfect 
match. 

We begin by assuming that the signal is perfectly de- 
scribed by a template T", so that the detector's output 
is 



s(t) = n(t) + ^T'(t). 



(6.1) 



Adopting the same conventions as before, we assume that 
D' is chosen so that T' obeys (T", T") = 1. For simplicity, 
and without loss of generality, we take to = 0. Assume 
that this signal is "close" to that of the template T, and 
hence that the signal is detected in that template. The 
SNR is 

z=(Q,s) = (f,h) + ^(f,f). (6.2) 

Using Schwartz's inequality, the inner product between 
the two templates must lie in the range [— 1, 1]. 



(f,f'f < (f,f)(f' 1 f l ) 
< 1. 



(6.3) 



One may think of the two templates as unit vectors sep- 
arated by an angle 6 and write this in the form 



(T,T') =cos0, for 9 e [0,tt]. 



(6.4) 



This inner product is often called the fitting factor. The 
expected value of the SNR 



/ \ D ' 



(6.5) 



is reduced by a factor of the fitting factor compared with 
the expected SNR D'/d' that would be obtained if the 
template bank contained the perfectly matching template 
V. 

The fractional difference between this "ideal case" ex- 
pected SNR and the expected SNR in the mismatched 
template is called the template mismatch e 



1 



(6.6) 



The value of e must lie in the range e S [0, 2], and may 
be restricted to the range e £ [0, 1] by changing the sign 
of T' if needed. Hence, without loss of generality we will 
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assume that < cos 9 < 1 and that < e < 1 . The case 
of most interest is when e<l. Typically template banks 
are set up so that the worst-case mismatch corresponds to 
a a loss of event rate (for a uniform source distribution) of 
10%. Since the volume inside a sphere of radius r grows 
proportional to r 3 and the SNR is inversely proportional 
to distance, this corresponds to a typical worst-case tem- 
plate mismatch of e = 0.033 = 3.3%. 

Following the same procedures as in Section II I II one 
can find the expected SNR squared, which is 



cos 



(6.7) 



Thus, the first- and second-order statistics of z are in- 
distinguishable from those that would be produced by a 
signal from a perfectly matched template T with SNR 
D'cosO/d'. 

To analyze the effects of the signal/template mismatch 
on the x 2 statistic is slightly more involved. We begin 
by considering the way in which the templates overlap 
in each individual frequency band. Define a set of p real 
constants Ai, • • • , X p by 



(T,T'). = Xjcost 



(6.8) 



It follows from l|6.4f> that these constants sum to unity, 



3=1 



(6.9) 



The average value of the Xj is 1 /p. The deviation away 
from this value is a measure of how close together (or far 
apart) the templates T and T" are in the frequency band 

The goal is to understand how the % 2 statistic is af- 
fected by signal/template mismatch. To determine this, 
we first express the SNR in the j'th band as 

Zj = (Q,s)j = {f,n)j + ^(f,f')j (6.10) 



D' 

[T, n)j + —Xj cost 



(6.11) 



Using calculations identical to Section IIVI the expected 
value of the SNR and its square in the j'th band are 



D' 



(zj) = —XjCosQ, and 



(6.12) 
(6.13) 



d' 

j p 

As before, we define Azj = Zj — z/p, giving 

A Zj = (f ,n), - ±(T,n) + ^{Xj - i)cos0. (6.14) 

The first difference between this analysis and the one for 
matching templates is that in the mismatched case, the 
expectation value of Azj does not vanish: 



(Az j ) = ^{X j -±)co S 6. 



(6.15) 



As before, we can use the assumption that the detector 
noise is second-order stationary to calculate 



D' 



cost 



= ([(r,ft)i + y A 

:[(!» + — cosf 



(T,f) j + (Jp) X jC os 2 e 

1 fD'\ 2 

A.cos^. 



(6.16) 



Using these results, it is straightforward to work out the 
expectation value of (Azj) 2 : 



((AzjY) = {z-) + 



2, , {Z 2 ) 1( ZJ Z) 



P 



(6.17) 



The expectation value of the \ 2 discriminator statistic 
(|4.14l) is therefore 



(x 2 > 



3=1 



l + {z) 2 ±p{X 3 -\) 



(6.18) 



3=1 



This is in sharp contrast to the case where the signal 
matched the template perfectly. In that case, the expec- 
tation value of x 2 was independent of the signal strength. 
Here, when the signal and template do not match per- 
fectly, the expected value of x 2 depends quadratically on 
the expected SNR (z) of the signal. 

The dependence of the discriminator (x 2 ) on the 
square of the expected SNR (z) 2 has a coefficient 



3 = 1 



P 



(6.19) 



3=1 



The quantity k is manifestly non-negative; we now ob- 
tain an absolute upper bound on its value. Schwartz's 
inequality implies that 

(f,f'f. < (f,f).(f',f'). 

X 2 cos 2 8 < -(f',f). 

3 — p y ' '3 



— IT' T') 



pcos 

Summing both sides over j one obtains 

p 



£a 

3=1 



1 

J ~ p COS 2 I 



(6.20) 



(6.21) 
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Combining this with the definition 
tains the bound 



< k < 



- 1. 



cos z 



191) of k, one ob- 



(6.22) 



Note that this relationship does not assume any relation- 
ship between the signal waveforms T and T' . 

For the case of most interest (small template mismatch 
e -C 1) this yields the bound 

< k < 2e (6.23) 

and hence one of the main results of this paper |49j 

(X 2 ) = p - 1 + n(z) 2 with < k < 2e. (6.24) 

This relationship only assumes that the fitting factor be- 
tween the signal waveforms T and T' is close to one. 

We can obtain a different and tighter bound on k if 
we assume that the frequency bands defined by (|4. 1|> 
and (|4.4(l are the same for the waveforms T and T'. 
For example, as discussed earlier in the context of equa- 
tion (|4.5|l . this is true for binary inspiral waveforms in 
the stationary-phase approximation. In this case, since 
(f',f")j = 1/p, equation (|6~2TJ|) implies that 



Thus one has 



pcos 



X 2 j cos 2 9 < 



1 , 1 

< A, < 



pcost 



(6.25) 



(6.26) 



It is convenient to define ujj = l/p — Xj. In terms of these 
quantities 



o 

■4 — ' 

CO 



\ e=2/p 


K<2e 


k < (p-l)e 





50 100 , . 150 

frequency bins p 



FIG. 3: The expected value of \ 2 f° r a signal that is not 
matched to the template satisfies (x 2 ) = P — 1 + K ( z ) 2 where 
(z) is the expected SNR. If signal and template share the same 
set of frequency bands, then below the curve e — 2/p one has 
k < [p— l)e 2 . Above the curve k < 2e. If signal and template 
do not share the same frequency bands, then k < 2e is the 
only limit that applies. 



The upper bound on k is 



k < (p - 1) 



cos ( 



(6.32) 



where in the final part of the relation we assume as before 
that the mismatch f «1. This is one of the other main 
results of the paper. In the case where the bands Ab- 
used to calculate \ 2 are the same for the template and 
the actual signal, p < 2/e, and e is small, one has 



k=pY j u 2 . (6.27) 

3=1 

The values of u>j are constrained by two relations: 



(X 2 ) =P~ 1 + k(z} 2 with < K < (p-l)e 2 



.33) 



This result is beautifully consistent with the previous 
limit on (x 2 ). At the boundary of validity (|6~3U|) of |JS31Jl, 

one has 



p cos( 



If we assume that p > 2 and 



u>j — 0, and 

3 = 1 

-)< Uj <-(! + — -). 



cos( 



e = 1 — cos 8 < 



then the maximum of k is obtained when 



(6.28) 
(6.29) 

(6.30) 



Ll> 2 = 



P - I 'l-^), and 



P 



cos 9 



1 ( 1 

= (J p = - 1 

p \ cos 



(6.31) 



(P-1) 



1 



cos( 



-1 = 



1 



- 1 



cos z 



2e (6.34) 



which agrees exactly with the previous limit Ij6.22|l . The 
limits that apply are summarized in Figure Note that 
when the two different limits l|6.22ll and (|6.32|) are ex- 
pressed approximately to lowest order in terms of e, they 
appear to differ slightly at the boundary p = 2/e. In fact 
they agree exactly: the approximate expressions differ at 
higher order in e. 

In Appendix E] show that if the detector noise is Gaus- 
sian, then the x 2 statistic (computing with a perfectly 
matching template) has a classical ^-distribution with 
p — 1 degrees of freedom. If the template does not match 
perfectly, the distribution becomes a non-central x 2 dis- 
tribution, with a non-centrality parameter determined by 
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the r.m.s. value of i|6.15[) . which is n(z) 2 . This is dis- 
cussed in more detail at the end of Sect ion IVlTl It follows 
from the fact that the variance of the terms that appear 
in the definition of \ , 

((A Zi ) 2 )-(A^) 2 = i(l-i) (6.35) 

are independent of the signal amplitude. 



VII. SIGNAL OF UNKNOWN PHASE 

As mentioned earlier, the signal from an inspiraling 
binary pair is a linear combination of two possible grav- 
itational waveforms with an a-priori unknown phase <f>. 
Here, we repeat the analysis done in the previous three 
sections, for this particular case of interest. 

As in l|2.6[l . the detector output is assumed to be of 
the form 



s(t) = n(t) + - (cos <f)T c (t - t ) + sm(j,T s (t - to)), 



with n(t) a random time-series drawn from a distribu- 
tion appropriate to the detector noise, and both <fi and d 
unknown. We assume (as is the case for binary inspiral) 
that the templates are orthonormal so that 



(f c ,f c ) = (f s ,f s ) = 1, and 



(T C ,T S ) = 0. 



(7.1) 
(7.2) 



Note that in the stationary phase approximation, T c and 
T s are exactly orthogonal. Were they not, a Gram- 
Schmidt procedure could be used to construct an or- 
thonormal pair spanning the same space of signals |50) . 

There are several (easy) ways to efficiently search for 
the unknown phase <j). In substance, all of these methods 
consist of filtering separately with the two templates T c 
and T s , and then combining the two filtered data streams. 
For our purposes a nice way to do this is to combine these 
separate (real) filter outputs into a single complex signal. 
Thus, we use the optimal filter 



Q = (f c + if. 



-Imfta 



(7.3) 



Note that with this normalization the optimal filter is 
normalized so that (Q,Q) — 2. 

The output of the filter is complex and is 



z = [Q,s 



= ! Q. h S + ( Q, ^{cos<t>f c + sin^f s )e~ 2mft ° 



Its expectation value is the complex number 

D , ,\ D 



[z) = — (cos (f> + 1 sm ( 



j<t> 



(7.4) 



The modulus of this complex number is the (expected) 
inverse distance, and its phase is the (expected) phase. 
Note that because the normalization of Q has changed, 
the expected value 



(N 



(7.5) 



is larger than in the single phase case. The additional 
uncertainty about the phase <f> means that the distance 
to the source can not be determined as accurately as in 
the single phase case. Following conventional practice in 
the field, the modulus \z\ will be called the "Signal to 
Noise Ratio" (SNR) although since in the absence of a 
source its mean-square value is two, one might argue that 
\z\/y/2 is the quantity that should carry this name. 

To construct the x 2 statistic, we choose frequency 
bands as before. We will assume that T c and T s have 
identical fre quen cy bands and are orthogonal in each of 
these bands [51|. This is exactly true in the stationary- 
phase approximation where T s (f) = iT c (f) for / > and 
f s (f) = -if c (f) for / < 0. Thus 



(f c ,T c ). = (f.,2;)^-, and (7.6) 
(f c ,f s ). = 0. (7.7) 
We define the complex signal Zj in the j'th band as before 

and also define Azj as before 



A Zj = zj 



(7. 



One then finds 



(|A^f) 
The x 2 statistic is defined by 



1 D 

p d 


e^ cos (2 


2 


-(-) 


- + 

P 


P 2 \d) 


2 




- + 

P 




P 





p 



(7.9) 



(7.10) 



and thus from l|7.9|l has expected value 

( X 2 ) = 2p-2. (7.11) 

In Appendix[X]we show that if the detector noise is Gaus- 
sian, then x 2 has a classical x 2 probability distribution. 
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Because both the real and imaginary parts of Azj sum 
to zero, the number of (real) degrees of freedom is 2p—2. 

We now consider the case where the astrophysical 
waveform h(t) = %-T'(t) does not exactly match any 
linear combination of the templates T c and T s . This is 
to be expected from real signals if the templates form a 
discrete finite grid in parameter space. As before, with 
no loss of generality we assume that to — and that D' 
is chosen so that 

(f',f) = 1. 

Consider the possible values, as ip <G [0, 2ir) varies, of the 
inner product 

(cos V>T C + sin 4>f s ,f). 

Since both cosij)T c + simpT s and T' are unit length, 
Schwartz's inequality implies that this inner product lies 
in the range [—1, 1], and its maximum value must lie in 
the range [0, 1]. This maximum value (see Appendix lB)l 



cos9 = y (T C ,T') 2 + (T S ,T') 2 , (7.12) 

which defines 9 £ [0,7r/2]. This in turn defines the mis- 
match e = 1 — cos 9 between the template T" and the 
one-parameter family of templates. Note that (in con- 
trast to the single-phase case) the maximization over ip 
automatically leads to cos 9 g [0, 1] and hence e € [0, 1]. 
We can also define (f> G [0, 2ir) by 



cos <p cos 9 ee (f c ,f"), and 

sin (j) cos 9 ee (f s ,f). (7.13) 

(f c + if s , f) = e^ cos 9. (7.14) 



Thus one has 



This equation may be taken as the definition of <fi and 9. 
The filter output is given by 

z = (Q,s) = (Q,h + h) 

and thus has expectation value 

(z) = ^(f c + if s ,f) 



D' 



-e r cos( 



(7.15) 



The expected square modulus of the filter output is 

(|2|2> = 2+ (^) 2cOs20 = 2+l(z)|2 - 

We now investigate the effect of the template/ signal mis- 
match on the x 2 statistic. 



To begin, we need to characterize the overlap between 
the signal and the templates in the j'th frequency band. 
Define complex quantities Xj by 



(T c + iT s ,T'). = Aj-e^cos 



(7.16) 



Using 17.14|) , these complex quantities are constrained by 

f> = l. (7.17) 

i=i 

The filter output in the j'th frequency band is given by 

D' 

Zj = (T c + iT Sl n).+ — A,-e'* cos 9. 
'3d' 

and the various expectation values in the j'th band are 
, - D' 



-A ? e l<?> cost 
d! 



(M 2 > 

(z*z) 



D' 



+ I5rj l^l 2cos2 ' 



(|A< 



P 



| + (t) W» 



- COS 

p 



The expected value of \ 2 is then 

(X 2 ) =2p-2 + K \(z)\ 2 

with 



p 

3 = 1 



A, 



(7.18) 



(7.19) 



To place an upper limit on k, note that from Schwartz's 
inequality, for any value of the angle tp, one has |53| 

(cosVT c +sin^f s , f') 2 . < -(f',f'). 



(cos V (T c , f '). + sin V(T S) f') 



.) 2 < i(f',f').. 



The maximum value of the left-hand-side (see Ap- 
pendix [HJ is 



(f c ,ff. + (f s ,f') 2 .= 



(»[Aie**] cos 6>) 2 + (QfAj-e 1 *] cosfl)' 



|A 7 fcos 2 0, 



where we have made use of H7.13f) and (|7.14(1 . Thus 



|A -'' |2 -^oW (T '' T ^- 
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Summing both sides over j and making use of (|7.19|l this 
implies that 



150 



< K < 



1 



cos 2 6 



- 1. 



(7.20) 



This result makes no assumptions about the form of the 
mismatched signal T'. As in the single phase case, if we 
assume that the mismatch is small, £< 1, we obtain 



< k < 2e. 



(7.21) 



This result does not assume any relationship between the 
frequency bands of the signals T and T' . 

If we assume that the bands Afj for the mismatched 
signal T" are the same as those for the templates T c and 
T s then we can obtain a much stronger upper bound. For 
example, this is the case if all three templates are drawn 
from a family of stationary-phase approximate inspiral 
chirps. In this case, (T',T')j = 1/p and the same logic 
as in the single-phase case can be used to establish that 



K < (p - 1) 



1 



- 1 



(P-1)C 2 



(7.22) 



provided that p > 2 and e = 1 — cos 9 < 2/p. Thus, with 
some minor modifications, all the single- phase results ap- 
ply to the unknown phase case. 

In the case where the signal and template are not a 
perfect match, the expected value of Azj does not vanish: 

(Azj) = ^-e* cos (A, - -). 
d! v p 

If the detector noise is stationary and Gaussian, then for 
a given astrophysical signal T" and filter-template T c>s 
this means that the probability distribution of x 2 is a 
classical non-central x 2 distribution [54| with 2p — 2 de- 
grees of freedom and non-centrality parameter 

A= P V|(Az J )| 2 ^ K |(z)| 2 . 



Unfortunately, for a set of candidate events, which corre- 
spond to different waveforms D'T' /d! and ring off differ- 
ent templates T, the values of k have different, unknown 
values, bounded only by i|7.20|l or (|7.22|l . In this case, 
since the average of non-central x 2 distributions with dif- 
ferent values of A is not a non-central x 2 distribution, one 
can only bound the expected distribution, not determine 
it from first principles. 



VIII. THRESHOLDING CONDITIONS 



As described in Section the \ 2 time-frequency dis- 
criminator is most often used as a veto. For a given 
data set, the threshold value x* is usually determined us- 
ing Monte-Carlo simulation of signals, analytic guidance, 



100 



50 



LIGO S1 %\ = 40+0.1 5 Izl 2 




< x 2 >=14+0.06<lzl> 2 



10 20 

Expected SNR <|z|> 



30 



FIG. 4: A comparison of different thresholds %* f° r the \ 2 
time-frequency discriminator, for the LIGO SI two-phase case 
with p — 8, e — 0.03, and worst-case n = 2e. The bottom solid 
line shows the expected value of y 2 f° r stationary Gaussian 
noise given by l)8.1|l . The top solid line shows a threshold 
18. 411 set 4.9 a above this expected value. The upper dashed 
line shows the heuristic threshold l|8.8|l used for the LIGO 
SI analysis, which was determined from Monte-Carlo stud- 
ies. The lower dashed line shows the threshold which would 
be exceeded with probability 0.025% in stationary Gaussian 



and experience. If the signal and template were known 
to have identical form, then the threshold x* would be 
a number. However since the signal and template are 
not expected to match perfectly, the threshold x* 1S a 
function of the observed SNR. 

It is helpful to understand the threshold that would be 
appropriate for stationary Gaussian noise. In this case, 
the optimal threshold is given by the inverse of the non- 
central x 2 cumulative distribution function |55| . A puta- 
tive signal whose x 2 value is smaller than this "Gaussian 
noise" threshold is likely to merit further examination, 
even if the noise is not Gaussian . So in most cases a 
reasonable threshold will be greater than or equal to the 
threshold appropriate for Gaussian detector noise. 

In the stationary Gaussian case, for fixed T and T", 
the expected value, variance er 2 , and standard deviation 
a of the non-central x 2 distribution with 2p — 2 degrees 
of freedom are 40! 



(X 2 ) - 2p-2 + \ = 2p-2 + K \(z)\ 2 , 



(8.1) 



<r = ((x ) > - (X ) 2 =4p-4 + 4A, and 
a 



■Ak\(zW 



In the neighborhood of the maximum, for values of 
the non-centrality parameter A significantly larger than 
2p — 2, the non-central x 2 distribution is approximately 
a Gaussian of width a, centered about the mean value 
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2p — 2 + A. In this case, the optimal x 2 veto threshold 
for Gaussian noise is well-approximated by 

X* = (X 2 > + few a, 

where a are the expected statistical fluctuations in x 2 
evaluated for the "worst-case" value of k. If we assume 
that the putative signals and templates do not share the 
same frequency bands A/j, so that the upper limit of 
(17.201) applies, then we obtain a x 2 threshold of the form 



12000 



XI 



2p-2+2e|SNR| 2 +few ^/Ap - 4 + 8e|SNR| : 



where we have replaced the expected SNR by the mea- 
sured SNR j53. Although we have justified this approxi- 
mation to the threshold for large non-centrality parame- 
ter A, it turns out to be a reasonably good approximation 
even when the non-centrality parameter A is small |58j. 

It is interesting to compare the threshold appropriate 
for Gaussian noise to the x 2 threshold used in the LIGO 
SI analysis for p — 8 and e = 0.03, which is equation 
(4.7) of reference [| 



LIGO SI x 2 = 40 + 0.15|SNR|< 



(8.3) 



shown as the upper dashed line in Figure 0] The lower 
solid curve shows the expectation value of x 2 given by 
(|7.18|) . and the upper solid curve shows a 4.9 standard 
deviation threshold given by l|8.2|) . which is 



x: 



= 14 + 0.06|SNR| 2 + 4.9^28 + 0.24|SNR| 2 . (8.4) 



As is clear from the graph, the heuristic threshold is rea- 
sonably well matched by the sort of threshold that one 
might set based on a worst-case analysis for Gaussian 
detector noise. 

For very large SNR, the Gaussian threshold condi- 
tion H8.2f) consists of two terms. The dominant term 
(quadratic in SNR) comes from the mean value (x 2 ) and 
has coefficient exactly k. The sub-dominant term (linear 
in SNR) comes from a few times a. Hence, a threshold 
like the LIGO SI choice would not veto high SNR events 
that could be confidently vetoed in Gaussian noise. This 
is illustrated in Figure [5] 

In the LIGO SI analysis, which sets an upper limit 
on the Galactic inspiral rate, the probability of observ- 
ing a close inspiral (very large SNR) is far smaller than 
the probability of observing a more distant (low SNR) 
event from near the Galactic center. Thus, applying the 
more stringent thresholding condition at high SNR would 
probably not have had a significant detrimental effect on 
the analysis: it would not have significantly decreased 
the detection efficiency. However it also would not have 
improved the analysis, since the highest SNR events that 
passed the x 2 threshold had SNR less than 16. 



IX. UNEQUAL EXPECTED SNR FREQUENCY 
INTERVALS 

One may also define a x 2 time-frequency discrimina- 
tor using frequency intervals which do not make equal 
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FIG. 5: A comparison of different thresholds for the \ 2 time- 
frequency discriminator, at large SNR. The upper solid curve 
is the LIGO SI threshold 18.8fl . The lower (dashed) curve is 
the threshold that would be exceeded with probability 0.025% 
in stationary Gaussian noise. (On the scale of this graph 
the approximation 18. 4H to the dashed curve is difficult to 
distinguish from the exact result obtained from the inverse of 
the non-central \ 2 distribution.) 



expected contributions to the SNR, and almost all of the 
previous results hold |59j. For simplicity, in this Section 
we treat only the single-phase case. 

Begin by making the same initial assumptions as in 
Section IVII but choose frequency intervals which do not 
make equal expected contributions to the SNR. Thus 



choose Afj so that (f, T% 



9j, 



(9.1) 



where the qj S (0, 1] do not necessarily equal 1/p. Tem- 
plate normalization (T, T) = 1 implies that they satisfy 

Define the SNR in the j'th band as previously 



z j — (Qi s)j> 



(9.2) 



and define Azj as the difference between observed SNR 
in the j'th band, and the value that would be anticipated 
60] based on the total SNR observed. Thus we define 



Az., 



•3) 



where as before the observed SNR is z = XL 



(Q, 3) 



Note that by definition the sum . Azj vanishes. 

Consider the case of a signal waveform T" which may 
be mis-matched to the template T, so (T, T') = cos 9 
and (without loss of generality) < cos6> < 1. Within 
the j'th band, the template T 1 has overlap 



(f,f) . = A,- cos (9, 



(9.4) 
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which may be taken to define the quantities Xj. As be- 
fore, the sum J2j = !• Taking the signal to be 



D' 



»(*) = »(*) + ~f T 'W 



(9.5) 



we can now compute the various expectation values. 

It follows immediately from the above definitions that 
these expectation values are given by 



cos 9, 



jy 

D' 

— Xj cos 9, 
D' 

— (Xj -qj) cos6, 
l+f^Vcos 2 *, 



(ZjZf.) = <//);;, + j ^- j A, A; cos" «. 



D' 

— ) A 2 cos 2 0, and 
ct' ' J 



D' 



((Az,) 2 ) = qj (l - qj) + I — 1 (Aj - q 3 Y cos 2 9, 

(9.6) 

where (the Kronecker symbol) 5j k = 1 if j = k and van- 
ishes otherwise. The formulae of Section IVII correspond 
to the special case in which qj — I /p. 

Define the \ 2 time-frequency discriminator in the un- 
equal expected SNR interval case to be 



X 



i=i 



(9.7) 



To characterize the statistical properties of x 2 , it is help- 
ful to express it in terms of a different set of variables. 
Begin by writing % 2 as 



X 2 = ^2(z j -q j z)Az j /qj 

i=i 
p 



V \ 2 



(9.8) 



where we have made use of the fact that the Azj sum 
to zero. Define new variables Uj = Zj/^/qJ, which have 
variance unity and are uncorrelated: 

(ujUk) - (uj)(uk) = 5j k , 
In terms of these variables, the statistic is 



X 



P , P _ y 

E"' 

3=1 V j=l 7 



(9.9) 



To characterize the probability distribution of \ 2 j it is 
convenient to change variables once again. 

We introduce new variables Vj which are linear com- 
binations of the Uj and are most conveniently written in 
matrix form as 



(9.10) 



where M is a p x p square matrix. Choose M to be an 
orthogonal matrix, which thus satisfies M t M = MM 1 = 
J, where t denotes transpose and / is the p x p identity 
matrix. These linear transformations (|9.1U|) are rotations 
[6l] that map W W. 

Since the transformation is orthogonal, the new vari- 
ables Vj also have variance unity and are uncorrelated: 



~Vi 




~Ul 




= M 




-Vp . 




-Up . 



{VjVk) 



(Vj)(Vk) 

p p 

E E M je M km[(ueu m ) - (u e )(u m )] 

1=1 m—X 

E E MjeMkmhrn 
E M jm M km 



Moreover 



E^wm*)^ 

m 

{MM% k = I jk = 
p p 



0j k . 



(9.11) 



(9.12) 



since rotations do not change the length of a vector. 

The rotation M may be chosen so that any given or- 
thonormal basis is mapped onto any other orthonormal 
basis of the same orientation (handedness). Thus one 
may choose the rotation so that the last of the new vari- 
ables is 



(9.13) 



qiui 



92 u 2 



qpiip. 



This corresponds to constraining the final row of M to be 
• • • 7 \fq~n)i or equivalently to requiring that M map 
the vector on the l.h.s. below to the final (p'th) basis 
vector. 







"0 " 








M 
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In terms of these new variables, \ 2 given in l|9.9|l may be 
written using (|9.12|) and l|9.13[) as 



p 



(9.14) 
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This form makes it easy to characterize the statistics of 
X 2 if the detector noise is Gaussian 

If the noise is Gaussian, then the Uj are uncorrelated 
(and hence independent) Gaussian random variables with 
unit variance, and the Vk are also uncorrelated (and hence 
independent) Gaussian random variables with unit vari- 
ance. Thus, in the case of Gaussian noise it immediately 
follows from (|9.14|) that \ 2 has a classical non-central 
X 2 distribution with p — 1 degrees of freedom. In the 
two-phase case, each of the Vj is a complex variable with 
independent real and imaginary parts and the resulting 
distribution has 2p — 2 degrees of freedom. 

The non-centrality parameter A may be evaluated by 
calculating the expected value of x 2 ■ Using H9.6|) and 
(|?T7| this is 



and hence that 



(X 2 )=P-1 + K -=r 



D' 



cos 



= p — 1 + k(z) 



(9.15) 



and hence the non-centrality parameter is given by A = 
k(z) 2 . The constant k, which is determined by the 
choice of intervals, the spectrum of the detector noise, 
and the frequency-dependence of the mismatch between 
templates, is given by 



3=1 
p 

^(X 2 j /q j -2X j +q j ) 



j=i 



3=1 



(9.16) 



Clearly k is non-negative. One can easily obtain an upper 
limit on k even in this case where the frequency intervals 
are not "equal SNR" intervals. 

To obtain a limit on k, begin with Schwartz's inequal- 
ity, which implies that 



(T,ry. < (t,t).(t',t') 
\ 2 cos 2 e < qj (f',f') 



cos 



cosf 



(9.19) 



Without loss of generality, relabel the frequency intervals 
so that 

qi < q_2 < ■ ■ ■ < %■ 

The value of k is maximized by setting: 

Ai = 1 + (qi - l)/cos0 
A2 = q-il cosO 

X p = q p /cos6. (9.20) 

This choice satisfies the constraint that J2j ^3 — 1) an d 
the r.h.s. of (|9.19|l . In order that Ai satisfy the l.h.s. of 
the constraint l|9.19|l we need to have 



1 — cos 9 < 2qi 



(9.21) 



or equivalently e < 2qi. For the values of Xj given in 
(|9.2UI) one then obtains 

p 

3 = 1 

= ((gi -1)/ COS 61 + 1- qi) 2 /qi + 

/I ,\2 / 1 ,\2 



COS ( 



COS( 



[(Ql - l) 2 / qi + q 2 + ■ ■ ■ + q p ] - I? 



'cos 9 



qi "cosf 



For p > 2 this gives the upper bound on n. 

Denote the smallest value of qj (qi just above) by q m i n . 
Then, in the case where the templates T and T" have the 
same values of qj for a given set of frequency intervals, 
and 1 — cos 9 < 2q m { n one has 



^Il3 < 



1 



cos 



-aV Tl ' T ')y 



(9.17) 



Summing both sides over j and using H9.16f) yields 
< K < —-7T-: - 1, 



cos^ 



(9.18) 



and hence for e -C 1 one has < n < 2e, just as in the 
"equal SNR interval" case. 

One can also establish a stronger limit analogous to 
(|6.^2(l for the case where the templates T and T" have the 
same values of qj for a given set of frequency intervals. 
In this case, Schwartz's inequality implies that 

(f,f) 2 . < (f,f).(f',f). 
X 2 cos 2 9 < q 2 , 



, 1 \ / 1 - cos ( 

< K < [ 1 



(9.22) 



If e = 1 — cos 9 is much less than unity, then this may be 
written 



< k < 



1 



- 1 e 2 if e < 2q n 



(9.23) 



These reduce to the previous results of Section rVTl when 
all the qj (and hence q m in) equal \/p. 

This "unequal expected SNR" x 2 discriminator may be 
of practical use when it is impossible to construct equal 
SNR intervals. It may also permit the construction of 
discriminators which are specifically tuned to common 
types of detector noise. 
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X. OTHER TYPES OF X 2 TESTS 

There are many possible x 2 tests that could be used 
to discriminate spurious signals from genuine ones. Here 
we compare the x 2 time-frequency discriminator of this 
work with a standard x 2 test used by Baggio et. al. [36| . 
which we denote by \ 2 ■ This tests "goodness of fit" for a 
modeled signal embedded in stationary Gaussian noise, 
and is used in analyzing data from AURIGA, a narrow- 
band resonant-bar gravitational wave detector. 

It is instructive to express this standard test in the 
notation of this paper, x 2 is constructed in two steps. 
First one picks a time interval [t% , t\ + r] whose length 
t is not less than the time duration of the signal-model 
template T convolved with S 1 " 1 , and which includes the 
support of S- 1 *T 63]. 

Then one constructs a function of a single real ampli- 
tude A (or, potentially, additional parameters describing 
the signal) 

X 2 {A) = (a- AT, a- AT) = (a, a)-2A(a,f )+A 2 . (10.1) 

Here, s is computed from Q2.1[l . but the integral is taken 
only over the time interval [t\,t\ +t]. As a function of 
A, x 2 has an absolute minimum at 



.4 



(S,f) 



(10.2) 



The minimum value of ^(A) defines x 2 '- 

X 2 = (s,a) - (a,f) 2 = (n,n) - (n,f) 2 . 

Thus, x 2 measures the difference between the squared 
amplitude of the detector output and the squared SNR. It 
is clear from this equation that x 2 is quite different from 
the x 2 discriminator defined in this paper. In particular, 
if the detected SNR vanishes (z = 0) then x 2 = (s, s), 
whereas x 2 = Y^—y (s, T) 2 . In this case x 2 is measuring 
the "total length" of s, while x 2 is measuring the sum of 
squares of the components of s obtained by projecting it 
onto p orthonormal components of T. 

It is also instructive to compute the expected value of 
X 2 in our frequency-domain-based formalism |r3 |. De- 
note the instrument's data acquisition sample time by 
At, so that the Nyquist frequency is /jv = 1/(2 At) and 
the number of data samples is N = t/ At. After setting 
a(t) to zero outside of the time interval [t±,ti + r], one 
has 



(\n(f)\'' 



T cT 



o Jo 



dt / dt'(n(t + tjntf + ii))e 2,r4/ ( t "*') 



df / dt / dt' Snif)^-™-^ 



JO 

T pOC 



df / dt / dt' S n (f')e 27Tl( f-^ (t -^ 



df / dt S n (f)S(f - f')<?*M-f')* 



In going from the second to the third line, we have as- 
sumed that r is greater than the characteristic time over 
which the autocorrelation function of the noise falls off. 
From (110. 3|) it follows immediately that 

((n, »)) = { ^W- df - 2f N r = N, (10.4) 

J-fn a n\J) 

where, as before, N is the number of data samples. And 
provided that the interval \t\ ,t\ + r] includes the sup- 
port of the template T, we have already shown that 
((n,f) 2 ) = 1. Combining this with (|TTH|) and (|TTH|) 
one finds the expectation value 



(X 2 ) = N - 1, 



(10.5) 



dtS n (f) = rS n (f). 



(10.3) 



corresponding to the fact that % 2 has a classical x 2 dis- 
tribution with N — 1 degrees of freedom. 

As this analysis and counting makes clear, the defini- 
tion of x 2 given in 36] includes the degrees of freedom 
associated with every pixel in the time-frequency plane. 
In contrast to this, the x 2 time-frequency discriminator 
defined in this paper includes only blocks of pixels cen- 
tered along the time-frequency track of the template T. 
In fact, when x 2 is actually computed from data, the 
number of degrees of freedom is reduced to include only 
those degrees of freedom in the sensitive band of the de- 
tector. Ha 



XI. CONCLUSION 

This paper defines a x 2 time-frequency discriminator 
which is an effective veto for the output of a matched fil- 
ter. The statistic looks along the time/frequency track of 
purported signal to see if the SNR accumulates in a way 
that is consistent with the properties of the signal and 
the second-order statistics of the detector's noise. Small 
values of x 2 are consistent with the hypothesis that the 
observed SNR arose from a detector output which was 
a linear combination of Gaussian noise and the putative 
signal waveform. Large values of x 2 indicate that either 
the signal did not match the template, or that the detec- 
tor was producing very non-Gaussian noise. The method 
appears to work well for broadband detectors and signals, 
and may have wider applicability. 

The main results of the paper are the definitions of x 2 
given in 14.14|l and (|7.10|) . and equations l|6.24|l . I|6.33|l . 
and H7.18fl which give upper bounds on the expected 
value of x 2 if the signal and template are slightly mis- 
matched. We also showed that the x 2 time-frequency 
discriminator is distinct from the standard "goodness of 
fit" x 2 test described in 36]. 

Recently the TAMA group has been experimenting 
with using \z\ 2 /y 2 as a thresholding statistic for detec- 
tion purposes Q , where x? is X 2 divided by the number 
of degrees of freedom. In Monte-Carlo simulation studies, 
they have shown that this prevents simulated high SNR 
events from being rejected by the discriminator. This is 
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one way to accommodate mismatch between templates 
and signals. 

The construction of x 2 requires the (a-priori or pos- 
terior) choice of how many frequency bands to use. An 
outstanding research question is "what is the best way 
to set the value of p?" The correct answer to this ques- 
tion probably depends upon a number of factors. These 
include: 

• The ultimate goal of the analysis (i.e., setting upper 
limits, or detecting sources). 

• The statistical properties of the detector noise 
(both broadband background and transient 
glitches) . 

• The maximum mismatch e of the template bank. 

• The accuracy to which the putative signal wave- 
forms can be calculated or predicted. 

One possible answer comes from the behavior of y 2 as 
a function of the template mismatch e. We have shown 
that there are two possible types of behavior, depending 
upon whether or not the two templates have the same 
power spectrum (which implies that they share the same 
intrinsic frequency bands). In some situations, it may 
make sense to work along the boundary in the (p, e) plane 
that separates these two types of behavior, as shown in 
Figure 

Some interesting work on this topic has been done by 
Babak 11] who has found the optimal value of p for the 
GEO detector by studying the relative distributions of x 2 
in the presence and absence of simulated inspiral chirp 
signals. 

A related issue concerns the construction of a tem- 
plate bank. The minimum number of required templates 
is fixed by physics and the behavior of the detector: one 
divides the volume of parameter space by the volume 
covered per template . However within this constraint 
the actual locations of the templates and their precise pa- 
rameters are quite arbitrary. It may be possible to break 
this degeneracy by constructing a template bank in such 
a way that the effects on \ 2 °f a signal/template mis- 
match are minimized, or bounded significantly below the 
absolute limits that we have obtained. Roughly speaking 
this corresponds to placing the templates in such a way 
that the overlap (T, T')j is simultaneously maximized in 
each of the different bands j = 1, • • • ,p. This might also 
require varying the value of p as one moves across the 
template bank. 

While the x 2 test was specifically constructed for 
broadband signals, it may be generalized to signals that 
are normally thought of as "narrow-band" . One example 
is the Continuous Wave (CW) signals expected from a 
rapidly rotating neutron star (pulsar). In fact, these CW 
signals are not so "narrow-band" . Typically, the Earth's 
motion around the solar system modulates such a sig- 
nal by a part in 10 4 over a six- month-long observation. 
Since the intrinsic frequency of such a source is of order 



1 kHz, and the frequency resolution during six months is 
of order 10 -7 Hz, these signals are actually spread over 
approximately 10 6 frequency bins. Thus a x 2 test could 
be employed for such signals. 

In fact a corresponding x 2 test could be implemented 
in the time domain for any type of signal. In effect, one 
simply breaks the template (viewed as a function of time) 
into p contiguous and non-overlapping sections, each of 
which gives an equal expected contribution to the total 
SNR [6g. One then forms the % 2 statistic by seeing if 
these relative contributions are clustered around the ex- 
pected SNR (which is a fraction 1/p of the total SNR). 
Note that an analysis like the one done in this paper 
shows that this quantity does not have a classical % 2 dis- 
tribution if the detector noise is Gaussian and colored. 
This is because the noise in two non-overlapping time in- 
tervals is correlated. However if the length of the time in- 
tervals is long compared to the characteristic correlation 
time of the noise, or if the detector output and template 
are whitened, then the resulting quantity would have a 
classical x 2 distribution for Gaussian noise. 
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APPENDIX A: DISTRIBUTION OF X 2 FOR 
STATIONARY GAUSSIAN DETECTOR NOISE 

Here, we derive the probability distribution function 
(pdf) of the x 2 discriminator under the assumption that 
the detector's noise is stationary and Gaussian. For sim- 
plicity we treat the single-phase case; the two-phase case 
corresponds to replacing p and p — 1 by 2p and 2p — 2 
respectively. 

Since the different Zj are each constructed from dif- 
ferent, non-overlapping frequency bands, they are them- 
selves Gaussian random variables. Hence their pdf is 

p 

P( Zl , ■■■,z p ) = Y[(2na)- 1/2 c- [z i- a/p]2/2a (Al) 
j'=i 
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with <7 = 1/p and a — (z) . 

We need to calculate the pdf of Azj = Zj — z/p. This 
is complicated by the fact that these variables are cor- 
related [63 since their sum vanishes exactly. We denote 
the pdf of Azj by P(Azi, • • • , Az p ). It is defined by the 
relation that the integral of any function of p variables 
F(ui, • • • , Up) with respect to the measure defined by this 
probability distribution satisfies 



J dm ... / 



dvi 



du p P(ui, • • • , u p )F{u\, ■■■ ,u p ) = 
dv p P(vi, ■■■,v p )x (A2) 

We can use this definition to find a closed form expression 
for P. 

Let F(ux, ■ ■ ■ , u p ) = Y[j =1 S(v,j - Azj) in (|X2|) . One 
obtains 



P(Az u ---,Az p )= (A3) 

n / dv > {2na y/2 % - j - e ^ 



3=1 



3=1 



To evaluate the integral, change to new variables 
w%, ■ ■ ■ , uip-i, W defined by 



vi = W/p + w\ 



Vp-l 
V 



Vp-l- 



= W/p + Wp„i 
= W/p — wi — ■ ■ 

The Jacobian of this coordinate transformation is 
d(vi,---,v p ) 



(A4) 



J 



det 



det 



d(wi,-- ■ ,w p -i,W) 



1 


• ■ 


• 







1 • • 


• 


1/p 





• • 


• 1 


1/p 


-1 


-1 • • 


• -1 





(A5) 



Using the linearity in rows of the determinant, it is 
straightforward to show that J = 1 . 
The integral may now be written as 



P(Az u ---,Azp 



dwi 



dw. 



dW x 



(2na)~ p/2 e~ [{vi ^ a/p) +-+(v p -«/p) 1/2ct x 
6(wi — Azi) ■ ■ ■ S(wp-i — Az p _i) x 
5(wi + ■ ■ ■ + uip-x + Azp). 



(A6) 



The argument of the exponential may be expressed in 
terms of the new integration variables as 



(vi - a/p) 2 H \-(v p - a/pf 



(A7) 



w\ H h w 2 1 + (W- af/p + (ioH h Wp_i) 2 



- P /2 e -[Az 2 1 +---+Azl + (W-a) 2 /p]/2a 



and thus the integral yields 
P(Az lr --,Az p ) 
= J dW(2na 

S(Az! H h Azp) 

= (2 7 r ( x)-P/ 2 (27r ( 7p) 1 / 2 e-[ A2 i + -+ A ^/ 2ff x 

6{A Zl + --- + Azp). (A8) 

This pdf is easily visualized. In W it vanishes except on 
the (p — l)-planc Azi + • • • Az p = 0. On that hyperplane 
it is a spherically symmetric function of the distance from 
the origin. 

This probability distribution arises because we do not 
know the true expectation value (z) but can only estimate 
it using the single measured value of z. This issue arises 
whenever the mean of a distribution is not know but must 
be estimated (problem 14-7 of 41] ). This probability 
distribution is "as close as possible to a Gaussian" subject 
to the constraint that the sum of the ASj must vanish. It 
is significant that this pdf is completely independent of a, 
which means that if the detector noise is Gaussian then 
the properties of the Azj do not depend upon whether a 
signal is present or not. 

We can now compute the probability distribution of 

X 2 = p(Azf H Az 2 ) using (|X8l) . The probability that 

X 2 < Xo i s the integral of i|A8() inside a sphere of ra- 
dius Xo/y/P- To evaluate the integral, introduce a new 
set of coordinates (x%, • • • , x p ) on W obtained from any 
special orthogonal SO(p) transformation of the p coor- 
dinates ^Vp(Azi, • • • , Azp) for which the new p'th coordi- 
nate is orthogonal to the hyperplane Azi + ■ ■ ■ + Az p = 0. 
For example take x p = Az\ + • • • + Az p . Let r denote the 
radius from the origin r 2 = x\ + ■ ■ ■ + x 2 , and note that 
X 2 = r 2 . The probability is then 



P( X <Xv) - / P(Az 1 ,...,Azp)p-r/ 2 <Fx 

e- r2/2 8{x p ) d p x. 



r <Xo 

(2-KapY 



r<Xo 



The integral over the coordinate x p is trivial, yielding a 
spherically-symmetric integral over R p_1 : 

P(X < Xo) = (27rap)5-i f e~ r "' 2 d p ~ l x. (A9) 

J r<xo 

Since this is spherically symmetric, we can write the vol- 
ume element d p ~ l x = £l p _ 2 r p ~ 2 dr where Cl n = 

1 1 2 / 

is the n— volume of the unit-radius n— sphere S n . One 
then has 

rxo 

P{X < Xo) = (2TTap)2-%n p ^ 2 / r p - 2 e~ r I 2 dr. 

Jo 

Changing variables to u = r 2 /2 this takes the form 

1 fXo/2 



P(X < Xo) 



u 2 2 e du 
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~(E _ I 2d-) 
) V 2 2 ; 2 / 

r(f-i) 

which is the classical x 2 cumulative distribution for p—1 
real degrees of freedom, expressed in terms of the incom- 
plete 7-function. 

APPENDIX B: MAX OF A COS(V>) + B SIN(^) 

Twice in this paper, we require the maximum of 
f(ip) — Acostp + B sin ip, for fixed values of A and 



B, as ip varies in the interval [0, 2tt). This is trivial 
to obtain. Setting the derivative df /dip to zero gives 
Bcosrpo — Asintpo = 0, implying that at the maximum 
tan tpo = B / A. Thus one has sec 2 V>o = 1 + tan 2 ipo = 
l + B 2 /A 2 and hence cos^o = A/\J A 2 + B 2 and sin?/>o = 
B/\J A 2 + B 2 . Substituting these into / yields the max- 
imum value f(tpo) = yj A 2 + B 2 . 
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do not hold. 

[47] The word "anticipated" is used, rather than "expected" , 
because we do not experimentally or observationally have 
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5zj = Zj — (zj) — Zj — (z) /p. For any given observation or 
data set, the quantities Szj do not sum to zero (although 
their expectation value does). They have expected square 
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by Jolien Creighton. 
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is not exactly the Fourier transform of the time-domain 
T s . 

[51] Even if the templates are not orthogonal in each of the 
p bands, it is possible to apply the Gram-Schmidt pro- 
cedure in each of these bands separately. But the subse- 
quent template would no longer be an optimal filter for 
the desired signal. 

[52] In early versions of GRASP (prior to April 1998) the 
author mistakenly thought that this two-phase quan- 
tity could be calculated by substituting the template 
defined by the maximum-likelihood estimator of cj> into 
the known-phase formula 14.1411 . This was corrected in 
GRASP by Allen, Brady, and Creighton in June 1998. 
Note that if one fixes the value of <j) then for Gaussian 
noise one obtains a statistic with a \ 2 distribution and 
p — 1 degrees of freedom. But if <j> is estimated from the 
data then it depends (in a non-linear way) on the detec- 
tor noise, and so even for Gaussian noise, the resulting 
statistic does not have a x 2 distribution. 

[53] In these inequalities, the quantities that appear on both 
sides are real because T c , T s , and T' are all real in the 
time domain. Thus in the frequency domain they obey 
Z{f) = Z*(-f). Therefore if we let X and Y denote 
any of T c , T s , and T' , the nine possible inner products 
(x,Yj are all real. 

[54] The fact that template mismatch leads to a non-central 
\ 2 distribution for Gaussian detector noise was first 
pointed out by Jolien Creighton. 

[55] For example the Matlab function ncx2inv(r,2p-2,A), 
where r is the probability that a signal in Gaussian noise 
would lie below this threshold, for example, 99.9%. 

[56] The converse is not true: with non-Gaussian noise, true 
signals may well have x 2 values that exceed a reasonable 
threshold for the Gaussian case. 

[57] It is reasonable to make this replacement, since the case 
of interest is large SNR, and for this case the fractional 
statistical fluctuations in the SNR are small, so that 
SNR » (SNR). 

[58] An even better approximation can be found in |42| . This 
leads to a x 2 threshold of 

(X) + K4 - 1) + 2W&«X 2 > - b) 

where b = ^yi^2p-2 i ncreases smoothly over the range 
[1/2, 1] as the SNR increases, and x v is the correspond- 
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ing probability threshold for a zero mean unit variance 
Gaussian. 

Stanislav Babak pointed out that this case was of inter- 
est, because for practical reasons, it may not be possible 
to construct "equal SNR" intervals. 

As previously, the word "anticipated" is used, rather than 
"expected", because we do not experimentally or obser- 
vationally have access to the expected value of the SNR 
(z). In other words, the quantities Azj = Zj — qjZ are not 
the same as Szj — Zj — (Zj) = Zj — <lj{z}. For any given 
observation or data set, the quantities Szj do not sum to 
zero (although their expectation value does). They have 
expected square values {(Szj) 2 } — q 3 . 
The p(p — l)/2-dimensional orthogonal group O(p) con- 
sists of two disconnected components. In one of these 
components, the determinants of the matrices M are +1 
and in the other component the determinants are — 1. The 
first component contains the identity matrix I. Strictly 
speaking, only the matrices in the component connected 
to the identity may be thought of as pure rotations. Ma- 
trices in the other component are combinations of pure 
rotations and reflections that do not preserve the "hand- 
edness" or orientation of the basis. 

Another approach to showing that 19.71 has a non-central 

x 2 

as 



EE 

3=1 k=l 



fqk' 



where P is a p xp matrix. In the single-phase (two- phase) 
case, P may be taken to be real and symmetric (Hermi- 
tian). One need only show that P has rank p — 1 and 
is a projection operator, meaning that P 2 = P. This is 
equivalent to saying that P has one zero eigenvalue and 
that all its other eigenvalues are equal to one. This im- 
plies that an orthogonal (unitary) transformation can be 
found which puts P into block diagonal form with a one 
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[65] 
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[67] 



sub-block proportional to the p — 1 dimensional identity 
matrix and the other sub-block vanishing. It follows that 
for Gaussian detector noise \ 2 nas a classical non-central 
X 2 distribution with p — 1 (2p — 2) degrees of freedom. 
The choice of time interval is not explicitly discussed in 
|36| but the natural choice is the support of S^ 1 * T 
Reference [36] assumes that the detector noise is station- 
ary and Gaussian, and then constructs a new basis (not 
an orthogonal transformation of the old basis) which di- 
agonalizes the inverse correlation function Although 
the authors do not say so, if the noise is stationary then 
(Xij = f(i — j) is of Topelitz form and depends only upon 
i—j. One can then show that for large N the diagonal ba- 
sis is the frequency basis obtained via a Discrete Fourier 
Transform. So in effect |36| is done in a frequency basis. 
At the end of Section V and the beginning of Section VI 
of [3^ |. the authors describe how the signal was down- 
sampled and frequency shifted to reduce the number of 
degrees of freedom N. This corresponds to choosing re- 
gions of the time-frequency plane that have good overlap 
with the putative signal and expected sources of elec- 
tronic and instrumental noise. It may be possible to re- 
define x 2 m a wa y that automatically includes only the 
relevant regions of the time-frequency plane, and makes 
it invariant under transformations such as oversampling 
that should leave it invariant. 

For signals like inspiral chirp signals, which have mono- 
tonically increasing frequency as a function of time, the 
different contiguous frequency bands Afj are in one-to- 
one and monotonic correspondence with the adjacent re- 
gions of time. 

In an early version of the GRASP manual, this corre- 
lation was neglected and a x 2 distribution with p rather 
than p— 1 degrees of freedom was obtained. This mistake 
was pointed out by Jolien Creighton. 
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